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Abstract — Recent research indicates that paclcet transmission 
employing random linear network coding can be regarded as 
transmitting subspaces over a linear operator channel (LOC). 
In this paper we propose the framework of linear operator 
broadcast channels (LOBCs) to model packet broadcasting 
over LOCs, and we do initial work on the capacity region 
of constant-dimension multiplicative LOBCs (CMLOBCs), a 
generalization of broadcast erasure channels. Two fundamental 
problems regarding CMLOBCs are addressed-flnding necessary 
and sufficient conditions for degradation and deciding whether 
time sharing suffices to achieve the boundary of the capacity 
region in the degraded case. 

Index Tenns — linear operator channel, network coding, broad- 
cast channel, capacity region, superposition coding, subspace 
codes 

I. Introduction 

Random linear network coding |[l] is an efficient alternative 
to achieve the network capacity proposed in |2|. In a random 
linear network coding channel packets are transmitted in gen- 
erations and are regarded as n-dimensional row vectors over 
some finite field F^. Due to the subspace preserving property, 
packet transmission over an acyclic noisy network may be 
thought of as conveying subspaces over a linear operator 
channel (LOC) |3 |, whose input and output symbols are taken 
from the set of all subspaces of F™ (referred to as "ambient 
space"). In [4 | Silva et al. investigated the capacity of a random 
linear network coding channel with matrices as input/output 
symbols. Later, by regarding a LOC as a particular DMC, 
Uchoa-Filho and Nobrega [5] studied the capacity of constant 
dimension multiplicative LOCs. Yang et al. JS), |[7| considered 
general non-constant multiplicative LOC capacity. In |8| the 
rate region of multiple source access LOCs was investigated. 

We will denote the set of all i-dimensional subspaces of F™ 
by 7'(F™, i). The following notation will be used in the sequel. 
Symbols X, Y and U denote random variables with values from 
subspace alphabets X, 2)' respectively il. The symbols X, Y 
and U denote subspaces in X, SI) and il, respectively. 

Constant-dimension multiplicative LOCs (CMLOCs) de- 
serve our interest, since they capture most packet transmission 
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scenarios. A precise definition of CMLOCs from the informa- 
tion theory point-of-view is the following. 

Definition 1. A constant-dimension multiplicative LOC (CM- 
LOC) of constant dimension I is a discrete memoryless 
channel (DMC) with input alphabet X = ViF"^,!) , out- 
put alphabet 2) = Ui=o^(^™'*) transfer probabilities 

piY\X) = py\x{Y\X) = p{Y - r|x = X) (X e X, Y e 2)) 
satisfying 



p{Y\X) 







if Y C Y, 
otherwise. 



(1) 



Here e^, < i < /, denotes the probability of receiving 
an «-dimensional subspace, and (')^ is the familiar Gaussian 
binomial coefficient. 

Our definition of a CMLOC is slightly different from that 
in |5|, where instead of e the rank deficiency distribution 
Pp{i)o<i<i (related to our distribution by Pp{i) = e;-0 occurs. 
In our case the total erasure probability is eo + ei + ' • • + £;-! = 
1 — e;, and e; is the probability of error-free transmission. The 
capacity of a CMLOC is given in 13] Th. 4]. 

As we know, only packet multicasting benefits from network 
coding and on the other hand multicasting at a constant 
rate would either starve receivers with high band-width or 
overwhelm those with a poor connection. This provides our 
motivation to investigate broadcasting over LOCs. 

Basic knowledge on broadcast channels can be found in 
f9l-nT|. Recent work showed that the computation of the 
capacity region of a discrete memoryless degraded broadcast 
channel is a non-convex DC problem lfT2l . Later Yasui et 
al. Ill] appUed the Arimoto-Blahut algorithm QH, |[l5l for 
numerically computing the channel capacity. 

The framework of general Linear Operator Broadcast Chan- 
nels (LOBCs) is presented in Section II with emphasis 
on constant-dimension multiplicative LOBCs (CMLOBCs), 
a generalization of the well-known binary erasure broadcast 
channel (BEBC). Two fundamental questions about CM- 
LOBCs are addressed: First, when will a CMLOBC be 
stochastically degraded? While for BEBCs the solution is quite 
obvious, for CMLOBCs the rich structure of possible erasures 



makes the problem quite intriguing. Our solution is discussed 
in Section III. Second, in the case of a degraded CMLOBC 
is time sharing sufficient to exhaust the capacity region? — 
for BEBCs the answer is "yes" and is again fairly obvious 
ifTel . In Section IV, we prove that for CMLOBCs this is not 
always true and further discuss the shape of the capacity region 
of CMLOBCs with subspaces taken from the projective plain 
PG(2, 2). Plenty of numerical analysis are shown on different 
cases of CMLOBCs over PG(2, 2) , via Arimoto-Blahut type 
algorithm in |fT3ll . Section V concludes the paper. Proofs can 
be found in the appendix (Section VII). 

II. Linear Operator Broadcast Channels (LOBCs) 
A. LOBC Module 

We consider the case of a multiple user LOC where a 
sender communicates with K receivers ui, U2,---,uk simul- 
taneously. The subchannels from the sender to Uk, k ~ 
1,2,...,K, are linear operator channels with input and out- 
put alphabets X,2) C \J^qP{P^ where m and q are 
fixed. Let X, Yi, . . . , Yfe be the corresponding random vari- 
ables. The output at every receiver is taken subject to some 
joint transfer probability distribution p(Yi, I2, ■ • • , yfel-'^) = 

PYi,Y.....,Y.|x(n,r2,...,n|X) = p(Yi = ri,Y2 = 

Y2, . . . , Yfe = YfelX — X). Such a channel is called Linear 
Operator Broadcast Channel (LOBC). For simplicity we re- 
strict ourselves to a LOBC with two receivers and let 9Jti, 
9H2 be the alphabets of private messages for user ui and U2, 
respectively. 

Deflnition 2. A broadcast (multishot) subspace code of length 
n for the LOBC consists of a set £ C X" of codewords 
and a corresponding encoder/decoder pair The LOBC encoder 
7: dJli X dJl2 — > £ maps a message pair (Mi,M2) to a 
codeword X — {Xi, . . . , Xn) G £ (for every transmission 
generation). The LOBC decoder S = {61,62} consists of two 
decoding functions 6i: 2)" ^ 2Jlj = I72) and maps the 
corresponding pair (^1,12) S ?)" x 2)" of received words to 
the message pair (Mi^Ah) = {6iiYi), 621X2)) 

The rate pair (i?i,i?2), in unit of g-ary symbols per sub- 
space transmission, of the broadcast subspace code is defined 
as 



R^ = 



log, 



i?2 = 



iog„ m2 



(2) 



n n 
As in |l9] Ch. 14.6] we can rewrite the encoding map as 

7: (i,2,...,q"^i)x(i,2,...,g"«=)^e: 

and associate with the broadcast subspace code the parameters 

Definition 3. A rate pair (i?i,i?2) is said to be achievable 
if there exists a sequence of {{q"^'^ , q^-^'^),n) broadcast 
subspace codes, for which the corresponding probabilities 
Pn = Pn{Mi 7^ Afi V A/2 7^ M2) of decoding error satisfy 
Pn ^ when n — > ooQ The capacity region (or rate region) 

'Here we tacitly assume that n runs through some subsequence of the 
positive integers for which all numbers , aj-g integers. 



of a LOBC is defined as the closure of the set of all achievable 
rate pairs. 

B. CMLOBCs 

If every subchannel in a LOBC is a CMLOC (necessarily 
with the same I, cf. Def. [TJ, we call it a constant-dimension 
multiplicative LOBC (CMLOBC). For CMLOBCs with ambi- 
ent space F™ and constant dimension I the normalized rate 
pair (_Ri, R2) can be defined in accordance with (|2|i as 



Ri 



Imn 



R2 = 



log, ^2 

Imn 



(3) 



By the principle of time division, it is clear that the capacity 
region of a CMLOBC should be at least the triangle area with 
three corner points-(0, 0), (C2, 0) and (0, Ci) on the R2) 
plane, where Ci refers to the channel capacity of X — ^ Y^, and 
all points {Ri,R2) satisfy R1/C1 + R2/C2 = lAi?i,i?2 > 
constitute the so called time sharing line. 

III. Degradation Theorem for CMLOBCs 

The following definition of degraded broadcast channels is 
taken from Itllj. 



Deflnition 4. A CMLOBC with transfer probabilities 
p{Yi,Y2\X) is said to be (stochastically) degraded if the 
conditional marginals p{Yi\X), p{Y2\X) are related by 
p{Y2\X) ^ T,Y^Pi^i\^)p'i^2\Yi) for some conditional dis- 
tribution p'{Y2\Yi). 

From Def. [T] it is obvious that CMLOBCs with (m, q, I) = 
(2,2,1) (the smallest nontrivial examples) are equivalent to 
ternary erasure broadcast channels with erasure probabili- 
ties Eq , Eq^'' for the two subchannels. Like a BEBC such 
broadcast channels are always degraded. In general, however, 
CMLOBCs are not degraded. Theorem [T] in this section gives 
a necessary and sufficient condition for a CMLOBC to be 
degraded. For its proof we need several lemmas. 



Lemma 1. Let e^^^ 



(4- 



(2) (2) 



.,ej ) be probability vectors. Then the following 



(1) ji) 
' *i ' 



, e|^'') and e 



(2) 



two statements are equivalent: 



1=0 ^3 



(1) 



(i) E 

( ii) There exists a lower triangular stochastic matrix A = 
such that e^^^A = e^^). 



Proof: See Appendix I VII- Al ■ 
For < /, s < TO let Vis be the incidence structure "Z- 
dimensional vs. s-dimensional subspaces of F™ with respect 
to set inclusion". Relative to suitable orderings of the input and 
output alphabet, the channel matrix of the CMLOC of constant 
dimension I with probability vector e = (eo, ei, . . . , e;) can be 
partitioned as 



S — (eoS(o I eiS;i 



(4) 



where Sis ("stochastic incidence matrix" of Vis) denotes an 
appropriate scalar multiple of the incidence matrix of Vis, 



determined by the requirement that Sjs be a (row) stochastic 
matrixQ 

Lemma 2. For integers l,s,t S {0, 1, . . . , m\ with I > s > t 
we have SuSgt = Su. 



Proof: See Appendix IVir^ ■ 
A CMLOBC with subchannels having channel matrices 
S(i)^ S*^^^ is degraded if and only if there exists a stochastic 
matrix T e M^x^^ (where M = Y.[=o (T),) such that 
g(2) ^ s(i)rp ^ggg p j4 Partitioning S^^), S^^) as in 
(HI and T accordingly, we can write this as 



-I 3/1 



/Too 
Tio 



Toi 
Til 



Toi\ 
Toi 



VT;o Til 



(2)< 



(5) 



With these preparations it is possible to prove 



Theorem 1. Let e^^^ and e^^^ be probability vectors associated 
with the two subchannels X — > Yi and X — >■ Y2, respectively, 
of a CMLOBC with ambient space F™ and constant dimension 
I < m. The CMLOBC is degraded (in the sense that Y2 is a 
degraded version of Yi) if and only if e*-^-* and e^^-* satisfy 



(6) 



Proof: See Appendix IVira ■ 
The excluded case / = to is indeed exceptional: In this case 
there is only one input subspace, so that the channel matrices 
reduce to probability vectors s'^^',s'^^' of length ill, where 
M = X]i!lo (T)g '■^^ '■^'■^^ number of subspaces of F™. 
However any two probability vectors s^^^s^^^ are related by 
s(i)X = §(2) for some stochastic matrix T of the appropriate 
size. (The matrix T — js'^^, where j is the all-one column 
vector of the same dimension as s, does the job.) This shows 
that in the case / = m the broadcast channel is degraded for 
all choices of e'^^, e'^^\ 

Corollary 1. Under the assumptions of Th. [7] suppose that 
e'^) and e'^^ satisfy 



g(i)<g^^j for every i G {0, 1, 2, 1}. (7) 

(and consequently e^' > ep''j Then the CMLOBC is degraded 
(in the sense that Y2 is a degraded version ofYi) . 



^The scaling factor for Dj^ is (')^ . 

^Note that <6) can be rewritten as J2''j=i <:j^' > 4^' for < « < « 

and implies in particular that the probabilities of successful transmission are 



related by ej^' > e 



(2) 



IV. The Capacity Region of Degraded CMLOBCs 
OVER THE Projective Plane PG(2, 2) 

A. Degraded CMLOBCs over the Projective Plane PG(2, 2) 

Let g = 2, TO = 3, Z = 2 and p{Yi\X), i = 1, 2, be defined 
through the channel matrices 



S(0 



El 321 



^2 



(8) 



where J7X1, Itx? denote the all-one, respectively, the identity 
matrix of the indicated sizes and S21 is a stochastic incidence 
matrix of 2-dimensional vs. 1-dimensional subspaces of Fj 
(in other words, an incidence matrix of the smallest projective 
plane PG(2, 2)). For example, we can take 



S21 



/ 1 


1 





1 








\ 


1 





1 





1 











1 


1 








1 











1 


1 








1 





1 








1 





1 


1 














1 


1 


V 








1 


1 


1 


; 



(9) 



By Th. [U the CMLOBC is degraded if and only if ej,^^ < 



.(2) A .(1) 



A e 



Q 



+ El ^ Co 



f 2 J. 

€l ^ or, equivalently, €q''' < 6q 



4^^ > 

Taking into account symmetry properties and keeping in 
mind the example of binary symmetric broadcast channels 
discussed in 19] Ch. 14.6], one might conjecture that the 
boundary of the rate region is obtained by taking the joint 
distribution p{U,X) which arises from a 7-ary symmetric 
channel U X and the uniform input distribution on il. This 
one-parameter family of distributions can be written in matrix 
form as 



1 



7a_^ 

6 / 



17x7) 



< CT < f. 



(10) 



Lemma 3. For the degraded CMLOBCs described by (|8]l, 
let p[j x(U, X) be chosen as in ( II Oi l, and with i?i(cr) — 
/(X,Yi|U), R2i<j) = /(U,Y2) let T = {{R,{a), R2{a)) \ 
o G [0,6/7]}. Then the curve V, considered as a function 
R2 = f{Ri) is defined on [0, Ci], strictly decreasing, and 
satisfies /(O) = C2, f{Ci) = 0. Further we have: 

(i) f is strictly concave (Ci) when ei^''e2^'' > ('^i^^2^\' 

(ii) f is strictly convex (U) when ei^'e2^'' < ^^^■'£2^''; 

(Hi) f is linear (i.e. T coincides with the time-sharing line) 
when e^e'^) = ^e^. 



Proof: See Appendix IVlLDi ■ 

Remark 1. If Case (i) holds for a degraded CMLOBC, then 
there exist superposition coding schemes which are superior to 
time sharing with respect to channel throughput. On the other 
hand, the family of joint distributions ( fTOl i does not necessarily 
determine the boundary of the capacity region. In particular 
we cannot conclude that in Case (ii) or (iii) of Lemma [3] the 
boundary is the time-sharing line. 



B. Numerical Analysis 

In each figure about the capacity region of some CMLOBC, 
we use a "filter" to delete points located below the time 
sharing line on the rate region plane and we always display 
two subfigures "before filter" and "after filter" at the same 
time. All the figures have enough pixel information to allow 
enlarging details. Relevant M-files can be found at ifTTl . Our 
analysis was done using MATLAB on a Linux system. 

Example 1. Let e^^' = (0.05,0.24,0.71), e^^) = 
(0.30,0.15,0.55). Then the condition of Case (i) is satisfied. 
Numerical results obtained by using the Arimoto-Blahut type 
algorithm from |13 | are shown in Fig.[Tl 



Capacity Regiaa of CMLOBC Example 1.1 Capacity Flegicn ct CMLOBC Exampie 1.1 alter Rter 




(a) before filter (b) after filter 

Figure 1: Capacity region of Example 1, e^^^ 
(0.05,0.24,0.71), e(2) = (0.30,0.15,0.55). 



Example 2. Let e^i) = (0.05,0.20,0.75), e^^) = 
(0.30,0.15,0.55), the condition of case (ii) is satisfied. Nu- 
merical results are shown in Fig. |2] indicating that time sharing 
might be sufficient to exhaust the capacity region. 




(a) before filter (b) after filter 

Figure 2: Capacity region of Example 2, e^^^ 
(0.05,0.20,0.75), e(2) = (0.30,0.15,0.55). 



Examples. Let e^^^ = (p^, pi, l-pi-p^), e^^) = (p^,P2,l- 
/92— P2)' where < pi < P2 < (— 1+\/5)/2. This corresponds 
to Case (ii). Numerical results are shown in Fig.|3] for the 
particular case pi = 0.1, p2 = 0.3 indicating that time sharing 
might be sufficient to exhaust the capacity region. 

Example 4. let q ^ 2, m ^ 3, I ~ 2, and define e^^^ — 
(0, pi, 1 - pi), e(2) = (0, p2, 1 - P2). with < pi < P2 < 1, 
the condition of case (ii) is satisfied. Numerical results are 




(a) before filter (b) after filter 

Figure 3: Capacity Region of Example 3, e^^-* = 
(0.01,0.1,0.89), e(2) = (0.09,0.3,0.61) 

shown in Fig. |4] for the particular case pi ~ 0.1, p2 — 0.3 
indicating that time sharing is suffice to exhaust the capacity 
region. 




(a) before filter (b) after filter 

Figure 4: Capacity Region of Example 4, e*^^^ = (0,0.1,0.9), 
e(2) = (0,0.3,0.7) 

C. A Conjecture on the Convexity of Capacity Region 

Overall the analysis supports the conclusion that super- 
position coding on CMLOBCs has no benefit over simple 
time-sharing unless we are in Case (i). However, proving the 
conjecture in full generality seems to be difficult. 

Conjecture. For the degraded CMLOBCs described by (|8]l, 
the capacity region is strictly concave (f]) if and only if 

^(i),(2r /2) (?) J y .1 

D. A special example-The 7-ary erasure broadcast channel 

Example 5. Let e^^^ = (pi,0, 1-pi), e^^) = (p2,0, 1-P2), 
where < pi < p2 < 1- Then the condition of Case (iii) is 
satisfied. Since (apart from unused output subspaces) there is 
now only one erasure symbol (the output subspace {0}), the 
subchannels of the CMLOBC become 7-ary erasure channels. 

The capacity region of this broadcast channel, more gen- 
erally of any CMLOBC with e^*) = (p,, 0, 0, 1 - pi) for 
i = 1,2, where < pi < /32 < 1, is determined by the next 
theorem. For the proof we need the following lemma. 

Lemma 4. Let U, X and Y be random variables with al- 
phabets ii, X and 2}> respectively, forming a Markov chain 
U — > X — > Y. Suppose that X ^ Y is described by 



VII. Appendix 



= ( pJ|i-|xi I (1-P)l|;e|xi3t| 
Then we have the relationships 

/(U;Y)-(l-p)/(U;X), 
/(X;Y|U) = (l-p)/(X;X|[/). 



(11) A. Proof of Lemma\l] 



(12) 
(13) 



This follows from linearity of mutual information with 
respect to the decomposition ( fTTT i and /(X; Y|U) = /(X; Y) - 
/(U;Y). 

Theorem 2. Suppose that the two subchannels of a CMLOBC 
are described by 



S« = (p«J|x|xi I (l-p,)I|3e|x|3e|), 



(14) 



where X = 7^(F™, I) and < pi < p2 < 1- Then its capacity 
region is the set of all pairs of {Ri, R2) satisfying Ri,R2 > 
and 

^ + < 1 (15) 



Proof: Suppose first that (ii) holds. Postmultiplying the 
equation e'^^^A ~ e*^^' by the matrix 



/ 1 
1 



V 1 1 






1 / 



(16) 



we obtain e^^'AL = e^^'L. The matrix A = AL = {6ij) 
is lower triangular with entries Sij < 1. (This follows from 

Sij = J2k=o ^ikhj — J2k=j < 'l2k=o '^i^ ^ ^-^ Hence 
we have 



(e(2)L) 



E 



.(1) 



.(1) 



(o<j<0- 



Then 



Proof: See Appendix IVII-EI ■ 

V. Conclusion 

In this paper, we have set up the framework of linear 
operator broadcast channels. We characterized degraded CM- 
LOBCs by a set of inequalities for their associated probability 
vectors. Necessary and sufficient conditions for a CMLOBC 
being degraded were obtained. The work on CMLOBCs over 
PG(2,2) shows that time sharing schemes do not always 
exhaust the capacity region. 

We conclude with some open problems arising from our 
work. 

• In the case of more general CMLOBCs (i.e. less noisy, 
more capable), whose rate region is not exhausted by 
superposition coding, investigate whether other coding 
technologies (dirty paper coding, etc.) are suitable for 
approaching the boundary. 

• How does the rate region of additive LOBCs or even more 
general LOBCs look like? The example of the binary 
symmetric broadcast channel suggests that in the generic 
case the nontrivial boundary curve R2 — f{Ri) is given 
by a strictly concave (n) function. 

• Construct good (multishot) superposition subspace codes 
for degraded LOBCs in the case, where rate splitting is 
needed to approach the boundary of the rate region. 
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E' 

i=0 



.(1) 



E 



e^^ < 1 



E4^' = E 



.(2) 



which implies (i). 

Now suppose that (i) holds. First we consider the special 
case where e^^^ and e^^) ^re related in the following way: 
There exist < i < j < I and a real number < A < 1 such 



that ) = eP' 



.(2) 



(1-A)4 



(1) 



and e^"'"'' ~ e"^^ for 



fee {0, 1, ...,/} \ {i, j}. In this case we have e^^'A = e(2), 
where A differs from the identity matrix only in the submatrix 
corresponding to rows and columns No. i, i + 1, . . ., j. The 
corresponding submatrix of A is 



/I 



1-A/ 



(17) 



so that A is clearly lower triangular and stochastic. In general, 
as is easily proved by induction, a new e(2) can be updated 
from e'^^ and last e'2) by a sequence of transformations of 
the above form (i.e., add A times the j-Xh component to the 
i-th component and subtract it from the j-th component for 
some < i < j < / and < A < 1). Since the set of 
lower triangular stochastic matrices is closed under matrix 
multiplication, the result follows. ■ 

B. Proof of Lemma\2\ 

Proof: Working with the ordinary incidence matrices T)is, 
Dst, D/t, the (i,j)-entry of DjsDst is equal to the number 
of subspaces V G 7^(F™,s) satisfying Ui ^ V ^ Wj, where 
e 7'(F'",0 and Wj £ V{¥"\t) denote the i-th resp. j-th 



subspace in the given ordering on 7'(F™,/) resp. 7^(F™,t). 
Thus 



st ) ij 







(18) 



if D Wj, 
if ^ Wj. 

This shows that HisGst = {[7_\) Tin is a scalar muhiple of 
Djf. Obviously we then also have SjsS^t = AS;t for some 
scalar A. Since Sis^st as well as S;t are stochastic, we must 
have A = 1, proving the lemma. ■ 

C. Proof of Theorem\J\ 

Proof: Suppose first that Condition ^ is satisfied. In (|5]l 



we choose T, 



XijSij with Xij e 



(where it is understood 



that Sij — whenever i < j). Using Lemma |2] we obtain 



S(i)t 



/AooSoo 
AioSio AiiSii 



00 



(e«An+erA 



(1). 



10, 



.(1) 



(1) 



(19) 



(20) 



By Lemma [T] we can further choose A = (A^j ) as a lower 
triangular stochastic matrix satisfying e'^^'A — e^^\ Then the 
resulting matrix T = (AySij) is stochastic and satisfies (|5]). 
Hence in this case the broadcast channel is degraded. 

Conversely suppose the broadcast channel is degraded, so 
that (|5]) holds for some stochastic (block) matrix T = (Ty ). 
First we will show that we can assume (without loss of 
generality) that Ty — for i < j. ^ says 



1=0 



e,P(SHTy) = ef for < j < Z 



If ep-* — then we can replace each block T^j, Q < j < I, 
by the corresponding all-zero matrix. Hence the assertion is 
true in this case. On the other hand, if e^' > then every 
positive entry in S/iTy forces a positive entry of S/j in the 
same position. Now suppose Tij has a nonzero (i.e. positive) 
entry in a position indexed by some subspaces V E 7^(F™, i), 
W G V(¥^,j). Then SuTij has a positive entry in each 
position indexed by the same subspace W (as a column index) 
and any subspace U E 7'(F™,Z) which contains V (as a row 
index). 

If ? < j then we can find a subspace U E "PCF™, I) which 
contains V but not W. This can be seen as follows: The space 
W — {W + V)/V is a nonzero subspace of F™/y. Hence 
there exists a subspace U of F™/1/ of dimension l — i< m — i 
which does not contain W. Then the preimage [/ of [/ in F™ 
has the required property. (The assumption Z < m is essential 
here!) 

Since U contains V but not W, the matrix SuTij has an 
entry > in the position corresponding to ([/, W) and S;^ has 



a zero in this position. This contradiction shows that e]^^ > 
implies Tij = for i < j, so that from now on we can indeed 
assume = for all i < j. 
Now we postmultiply (|5]l by 



/Soo 
Sio 





Sii 







(21) 



\Sio Sii ••• SiiJ 

Using Lemma |2] on the left-hand side and setting A = TL 
(Aij) on the right-hand side we obtain 



^,(1) 

i=3 



e- 'SliAij 



i=3 



So (o<j<0- 



(22) 



Applying these matrix equations to the all-one column vectors 
j of the appropriate dimensions gives, in view of Sjj j = j and 
{SuA.ij)i = Sii(Ayj) < Snj = j, the required inequalities 



proof of the theorem. 



— ^^i^ < j 1^ I), which completes the 



D. Proof of Lemma\3\ 

Proof: During the proof we write Y^'^ i = 1,2, for 
the subchannel outputs (here Y^*^ corresponds to the prob- 
ability vector e*^*') and Y^, s — 0,1,2, for the dimension s 
component of Y^*-' (corresponding to the s-th block in the 
decomposition ([8])), which is independent of i. We will use 
the (easily established) fact that mutual information is linear 
in the following sense: 

/(X;Y«|U)=^eW/(X;Y,|U), 



/(U;Y(2))=^42)^(U;Y,), 

s=0 

which generalizes to arbitrary decompositions of the form (01). 

Clearly /(X;Yo) /(U;Yo) = 0. The (symmetric) chan- 
nels X — > Yi, X — > Y2, U — > Y2 have channel matrices 
S21, Itxt^ §"^7x7 + (1 ~ ■'■7x7' respectively. The channel 
U — >^ Yi has channel matrix 

(f J7x7 + (1 ^ I7X7) S21 = f J7X7 + (1 ^ S2I 

The input distribution on it (and hence the distribution on X 
as well) is uniform, this gives 

i?2(fT)=/(U;Y(2)) 

= (-i7(f)+log|-f log|) 

+ e^^^ (-i? ((t) + log 7 - (t log 6) , 

i?i(a)=/(X;Y(i)|U) 



= eW (i/(f) + f log 



+ (i7(a)+ a log 6), 

where iJ(a;) = — x log a;— (1 — x) log(l — a;) denotes the binary 
entropy function. To simplify the expressions below, we will 



take log as the natural logarithm, for which H'{x) — log 
H"{x) = We have further 

ril / \ (2) / 2 1 l-2cr 
=4 ("llog^ 



^(2) 
^2 

J2)2 
^1 3 



log- 



-2cr/3 

/3 
1 - (7 



I log I 



log 



2(l-2g/3) 



,(1)2 



log 



2(l-2a-/3) 



log 6 



(2) , 6(1-0-) 



^1 3^"B a r-c2 

From this one verifies at once that R'i{(y) > 0, R'2{o') < for 
< < I (and = = 0). Hence, by results from 

standard calculus, / is well-defined and /'(_Ri(cr)) 



RUa) 



0, so that / is strictly decreasing. Moreover, since i?i(0) = 0, 
i?2(0) = e('hog| + e^'hog7 = C2, i?i(f) = el'^ log | + 
4'^ log 7 = Ci, i?2(f) = 0, we have /: [0, Ci] ^ [0,^2], 
/(O) = C2, and /(Ci) = 0. 

In order to decide whether / is convex/concave/hnear, we 
use the second derivative test from standard calculus. We have 
to determine the sign of 

I ,x R'i{cr)R{{a) ~ R'i{cj)R'^{a) 

f {Rii'y)) = 

for cr e (0, |), which is the same as the sign of 

R'^{a)R[ia) ~ R'l{a)R',{a) = 



(2) (1) J1)J2). 



cr(l - cr)(3 - 2(7) 

-(l-f)logHilzXl 



(l-a)log 



6(1-0) 



(23) 



It may be verified that the right-hand factor 

g{a) = (1 - a) log(l - a) - (l - f ) log (l - f ) 
+ iCTl0g(T+ (1 -CT)log6- (l-^)log2 
satisfies .g(0) = g(|) = and 

^»-- .(l-.)'(3-2cx) <" f-0<.<f, 
from which it follows that g{a) is positive in (0, |). Hence the 
sign of /" in (0, |) is constant and equal to that of ei^''e2^'' — 
. This concludes the proof. ■ 

E. Proof of Theorem\2\ 

Proof: It is clear from Lemma |4] that the capacities of the 
subchannels are Q = (1 — pi) log |X| (i = 1, 2). Further, for 
an arbitrary joint distribution p{U,X) Lemma |4] gives 

C2/(X;Yi|U) + Ci/(U;Y2) 

= (l-p2)log|X|(l-pi)/(X;X|U) 
+ (l-pi)log|X|(l-p2)/(U;X) 
C1C2 



log|X| 
C1C2 



/(X;X) 
H{X) < C1C2, 
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